Feature Allocations, Probability Functions, and Paintboxes

نویسندگان

  • Tamara Broderick
  • Jim Pitman
  • Michael I. Jordan
چکیده

The problem of inferring a clustering of a data set has been the subject of much research in Bayesian analysis, and there currently exists a solid mathematical foundation for Bayesian approaches to clustering. In particular, the class of probability distributions over partitions of a data set has been characterized in a number of ways, including via exchangeable partition probability functions (EPPFs) and the Kingman paintbox. Here, we develop a generalization of the clustering problem, called feature allocation, where we allow each data point to belong to an arbitrary, non-negative integer number of groups, now called features or topics. We define and study an “exchangeable feature probability function” (EFPF)—analogous to the EPPF in the clustering setting—for certain types of feature models. Moreover, we introduce a “feature paintbox” characterization—analogous to the Kingman paintbox for clustering—of the class of exchangeable feature models. We provide a further characterization of the subclass of feature allocations that have EFPF representations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Some New Results on Policy Limit Allocations

Suppose that a policyholder faces $n$ risks X1, ..., Xn which are insured under the policy limit with the total limit of l. Usually, the policyholder is asked to protect each Xi with an arbitrary limit of li such that ∑ni=1li=l. If the risks are independent and identically distributed with log-concave cumulative distribution function, using the notions of majorization and stochastic orderings, ...

متن کامل

A characterization of product-form exchangeable feature probability functions

We characterize the class of exchangeable feature allocations assigning probability Vn,k ∏k l=1WmlUn−ml to a feature allocation of n individuals, displaying k features with counts (m1, . . . ,mk) for these features. Each element of this class is parametrized by a countable matrix V and two sequences U and W of non-negative weights. Moreover, a consistency condition is imposed to guarantee that ...

متن کامل

Asymptotic existence of proportionally fair allocations

Fair division has long been an important problem in the economics literature. In this note, we consider the existence of proportionally fair allocations of indivisible goods, i.e., allocations of indivisible goods in which every agent gets at least her proportionally fair share according to her own utility function. We show that when utilities are additive and utilities for individual goods are...

متن کامل

Metaprogramming for the Generation of Nonparametric Curves

One of the most important functions of paintboxes is drawing curves. These primitives have been programmed and the user can never add a new program which computes the discrete points of a given function. Using metaprogramming and the Jordan’s method, our program CAPC automatically generates, for a given function, a new program which computes the discrete points for this function and adds it to ...

متن کامل

A Self-organized Multi Agent Decision Making System Based on Fuzzy Probabilities: The Case of Aphasia Diagnosis

Aphasia diagnosis is a challenging medical diagnostic task due to the linguistic uncertainty and vagueness, large number of measurements with imprecision, inconsistencies in the definition of Aphasic syndromes, natural diversity and subjectivity in test objects as well as in options of experts who diagnose the disease. In this paper we present a new self-organized multi agent system that diagno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013